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Abstract: We present a procedure that determines the law of a random walk in an iid random 
environment as a function of a single "typical" trajectory. We indicate when the trajectory charac- 
terizes the law of the environment, and we say how this law can be determined. We then show how 
independent trajectories having the distribution of the original walk can be generated as functions 
of the single observed trajectory. 

1. Introduction 

Suppose you are given a "typical" trajectory of a random walk in an iid random envi- 
ronment. Can you say what the law of the environment is on the basis of the information 
supplied by this single trajectory? Can you determine the law of the walk? Such questions 
may arise if one intends to use the random environment model in applications. 

These questions are essentially pointless if the group is finite (in which case the environ- 
ment at each of the finitely many sites that happen to be visited infinitely many times can 
of course be determined, but it is hard to say much more). So we assume that the group is 
infinite, and we go a little further: we assume that the (random) set of sites visited by the 
walk is almost surely infinite. (See remark 5.1.) 

Questions of this kind have been studied in the context of random walks in random 
scenery by Benjamini and Kesten pP, Lowe and Matzinger [3], and Matzinger [5]- 

In the case of an iid random environment, the information furnished by a single ( "typical" ) 
trajectory tells us whether the walk is recurrent; indeed, one can show that one of the events 
{each visited site is visited infinitely many times}, {no site is visited infinitely many times} 
is an almost sure event. (Cf Kalikow [3].) 

Now, if the walk is recurrent, the problem is quite simple: we can know much more than 
the law of the environment, because we find the environment itself at each visited site, 
which is given by the frequency of each possible jump from this site. In the transient case, 
the "naive" approach consisting of doing statistics on sites which have been visited many 
times cannot be utilized directly, since the assumption of being at a site which has been 
frequently visited introduces a bias on the environment at that site, which should encourage 
jumps to sites from which it is be easier to come back (loosely speaking, close sites). 

We present a procedure that eliminates any source of bias, collecting information on 
sites displaying some specified "histories". Each such "history" which can be encountered 
is almost surely encountered infinitely often (Proposition 3). This is combined with an 
interpretation of the process as an edge-oriented reinforced random walk (cf Enriquez and 
Sabot [2]), allowing us to find the exact law of the process. Now, there may exist "bad" 
transitions: if the walk jumps from a site along a "bad" transition, it will never get back to 
that site. If the set of these "bad" transitions is empty or has just one element, we can find 
the distribution of the environment (Theorem 1). 
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Finally, we show how countably many independent trajectories having the distribution of 
the original walk can be generated by concatenating steps of the observed trajectory. The 
algorithm is purely "mechanical": it does not imply any computation, and, in particular, 
the knowledge of the law of the walk (or of the environment) is not needed. 

2. Framework and notations 

Our "canonical" process X := (X n ) n >o walks on a group G. We denote by (J r n )n>o the 
natural filtration of X. 

We assume that the group G is Abelian, although this is never used in our arguments. 
Its only utility is in the possibility of writing things like x — y = e or x = y + e indifferently. 

We use the additive notation, and the identity element of G is denoted by 0. 

We assume moreover that the group G is countable. This assumption can be dispensed 
with— see remark 5.2— but we feel that it renders the reading easier and the discussion more 
tractable. It does not affect the core of the argument. 

We denote by N the set {1, 2, . . . }. 

2.1. Random walks in random environment 

We denote by V the set of non-negative families p := (p e )eeG such that ^2 e& QPe = 1- 
The environment at the site x, v{x) := (v(x, e)) ee <3, is a random element of V . We assume 
that the environments at sites are iid V- valued random variables with common distribution 
A*- 

We let v e := 2/(0, e). 

The random environment v := [v{x)) x ^g is a random element of V G , and it is governed 
by the probability measure pL® . 

For all 7r = ((vr(x, e)) e& c)x£G 6 V G ^ let Q w be the probability measure under which 
X is a G-valued Markov chain started at whose transition probability from x to x + e is 

ir(x, e) (x, e £ G). 

The law of the random walk in random environment (or the so-called "annealed" law) is 
the probability measure P M = J Q w //® G (<i7r) (= E[Q y ], Q v being what is usually called the 
"quenched" law). 

We recall our "infinitude assumption" according to which the (random) set of sites visited 
by the walk, S := {X n \ n > 0}, is P^-almost surely infinite. (This implies, of course, that 
the group G itself is infinite.) 

We let E denote the set of those g 6 G such that the probability of the event {v g > 0} is 
strictly positive. (It is easy to see that the random set {X n+ \ — X n \ n > 0} is P^-almost 
surely exactly E.) We then partition E into two sets, R and T, defined as follows. 

• R is the set of elements r of E that can be written as —(ex + ... + e n ) where (ej)i<j< n 
is a finite nonempty sequence of elements of E. It is easy to see that r G R if and only 
if P^(Xi = r and, for some n > 1, X n = 0) > 0; and Proposition 3 below implies that 
if r G R, then the random set {n \ X n+ \ = X n + r and, for some k > 0, X n+ k = X n } is 
P^-almost surely infinite. It is therefore quite easy to identify R when observing a single 
trajectory. 

• T is the complement of R in E. It represents the "possible" transitions which do not 
allow a return to the original site. 
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2.2. Histories 

We start with some definitions. 

Definition 1. The history of the site x at time n, which we denote by H(n,x), is the 
random finite sequence of elements of G defined by the successive moves of the process from 
the site x before time n. More formally, H(0,x) is the empty sequence (); H(n + 1, x) = 
H(n,x) if X n ^ x; and, if X n = x, then H(n + 1, a;) is the finite sequence obtained by 
adjoining X n+ \ — X n as a new rightmost term to H(n,x). 

Let us denote by (Z+)o the set of families (n g ) g ^c £ ^+ with a finite number of non 
null terms. 

Definition 2. The unordered history of the site x at time n, denoted by N(n, x) := 
(N g (n,x))g£G! is a random element of (Z5)o where, for all g € G, N g (n,x) is the random 
number of moves from the site x to x + g before time n. In other words, N g (n,x) = 

Yh=o l {X l =x,X l+1 -X l =g}- 

Also, the local unordered history at time n is the unordered history of the site X n at time 
n, N(n) := N{n,X n ). 

2.3. Reinforced random walks 

An edge-oriented reinforced random walk consists of a discrete random process whose 
transition probabilities are functions of the number of each type of move in the history of 
the process that has been made from the site currently occupied. A good point of view, 
in order to get a non-biased procedure of reconstitution of the environment, is to view the 
random walk in random environment as an edge-oriented reinforced random walk. It is the 
essence of the easy part of the result of |2j (the other part examines the conditions on a 
reinforced random walk to correspond to a RWRE). 

We introduce the reinforced random walks by the following definitions. 
Definition 3. A reinforcement function is a function 

V : (Z?) V 
n = {n g ) geG i ^ V(n) := (V e (n)) eeG 

Definition 4. We call edge-oriented reinforced random walk with reinforcement function 
V the random walk defined by the law P v on the trajectories starting at given by 

P V (X n+1 -X n = e\T n ) = V e (N(n)). 
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3.1. RWRE as an edge-oriented reinforced random walk 

We can now state the result of Enriquez and Sabot [2] : 

Proposition 1. The annealed law P M of the RWRE coincides with the law of the 

reinforced random walk whose reinforcement function V satisfies, for all e E G, 

v ' (n)= iirw^ 

whenever n € (Z^)o such that E[rj 9gG Ug 9 ] > 0. 
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In order to be self-contained we recall the proof of this proposition. 
Proof. For any x,e in G, for all n £ N, P^-almost everywhere on the event {X n = x}, 

Now using the independence of the random variables is(y) for different sites y, the terms 
depending on u(y) for y ^ x cancel in the previous ratio, and we get the result. □ 

The following result is an analogue of the strong Markov property for reinforced random 
walks. 

Proposition 2. Let X be a reinforced random walk with reinforcement function V , and 
let T be a stopping time with respect to the natural filtration of X. Assume T is almost 
surely finite. Then 

P V (X T+1 - X T = e | T T ) = V e (N(T)) P v - a.s. 
The proof is obtained in an obvious way, by considering the events {T = n}. 

3.2. A zero-one result 

The following zero-one result happens to be quite useful. 

Proposition 3. Let (n,...,ri) a finite (eventually empty) sequence of elements of R. Let 
S(n,...,ri) be the random set {x € G j 3n > 0, H(n,x) = (n, n)}. Then Sr ru ... jr A ^ either 
P^-almost surely empty or P^-almost surely infinite. 

Proof. Suppose that Sr ri ^ ri ) is not P^-almost surely empty. 
This implies that there exists a list of transitions 

L := (ri, ei,i, ei j2 , e ljkl , r 2 , e 2) i..., e 2 ,k 2 , »"3, , > r i) 

such that, for all m E {1, — 1}, 

km fe 

r m + '^2 e m,i = and, for all k € {1, . . . , k m - 1}, r m + ^ e m>i / 

i=l i=l 

and 

)]>0. 

fc=l m=l i=l 

(Note that if r m = 0, then k m = 0.) 

Let q := I + k\ + ... + £^_i be the length of the list L. 

For all € {1, . . . , g}, denote by y k the fe-th term of the list L. For all k € {1, . . . , q+ 1}, 
set x k := YjiZl Vi- (xi = 0.) 

Now consider the list (<?i, <7 2 , •••) of newly visited sites in their order of appearance. So 

5 = {gi,g 2 , •••}• By the assumption made in the introduction, S is almost surely infinite. 

To any integer n > 1, we associate a random integer i(n) defined by 

i(n) := mm{i > 1 | 3k G {1, . . . , q}, g { = g n + x fc }. 

We denote by fe(ra) the random smallest integer m > 1 such that <7j( n ) = g n +x m . (Obviously, 
/c(n) < q.) The sequence (<7i( n ))n>i takes infinitely many values (since the infinite set S is 
included in <7i( 2 ), •••} — x q }). Now, for any i > 1, we denote by Tj the hitting 
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time of gi by the walk. By definition of i(n), none of the sites g^ — xu n \ + Xj (1 < j < q) 
is visited by the trajectory before time T^ n y 

As a result, there exist infinitely many sites g'^g'^--- visited by the trajectory (enu- 
merated in their order of appearance) such that, for some k € {1, . . . , g}, if T' n denotes the 
hitting time of g' n , then none of the sites g' n — x k + Xj (1 < j < q) is visited by the trajectory 
before time T' n . We denote by k'{n) the least such integer k. T' n are clearly stopping times. 

For all n > 1, let tp n be the Bernoulli variable that equals 1 if and only if 

Xi . = f 9n - x k'(n) + Xi+k'(n) if * G {1, ■ ■ ■ , Q - k'(n)}, 

Tk+l \ g' n - x k'{n)+ Xi-k>(n)- q if i € {q + 1 - k'(n), . . . , 2q - k'(n) + 1}. 

(Otherwise, ^ n equals 0.) 

Observe that for all n, if ip n = 1, then Xt^ — x k rr n \ S <S'(ri,...,r;)- 

Due to the fact that the prescribed path the process has to follow during the time period 
[T^,T^ + 2q — k'(n) + 1] in order to satisfy ip n = 1 is a path that does not intersect the 
trajectory before T n , 

P(^n = 11-FtJ > E[J]>(x fc ,y fc )) 2 ] > EfQ u(x k ,y k )} 2 = 7 2 . 

k=l k=l 

Let £ n := ip(2q+i)n and r„ = T' {2q+1)n (n > 1). 

For all n, £i, ... , £ n are measurable with respect to JF Tn+1 . 
It is now obvious that for all n, k > 1 

P(e„+1 = • • • = £n+fc = 0) = 
P(£„+l = 0) • P(£„+2 = | £n+l = 0) • • • P(f n+fc = I £n+l = • • • = = 0) < (1 - 7*)* . 

Therefore, almost surely, infinitely many of the Vn's are equal to 1. This implies that 
S(n,...,ri) is almost surely infinite. □ 

Remark. If G equals the notion of convexity can be exploited in a proof slightly different 
from the one given above. 

We deduce that the sets R and T can be "viewed" on the trajectory: 
Corollary 1. The sets R and T are such that 

R == {g G E | St g \ is infinite} = {g 6 E \ SV ff ) ^ 0} and T = s {g € E 1 j SV ff ) = 0}. 

Let Sft denote the random set {x G G \ 3n > 0, N(n,x) = ft}. An easy corollary of the 
above proposition is the following analogous result concerning unordered histories. 

Proposition 3'. If n 6 (Z+)o, i/ien 5^ is either P^-almost surely empty or P^-almost 

surely infinite. 

We now distinguish a particular subset of (Z+)o, 

A/" poss := {n € (Z?) | 5g + P** - a.s.}. 

Loosely speaking, M VOS s is the set of "possible" unordered histories for sites that are 
presently occupied. An element ft = (n g ) ge G of (Z'f)o belongs to J\f poss if n g = whenever 
g $l R and if, moreover, it satisfies (any one of) the three following equivalent conditions: 

(a) P^(3n > 0|JV(n,0) = n) > 0; 

(b) the random set Sft is almost surely infinite; 
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(c) E[n refl ^i>o. 

Note that, by (c) and Proposition 1, the law of the process is determined by the restriction 
of the reinforcement function to the set M poss . 

4. What a single trajectory tells 
In the sequel we assume that the law of the process X is P M (or, equivalently, P v ). 

4.1. Determining the law of the walk 

As noted in the introduction, "straightforward" computation based on the frequencies 
of transitions from sites visited "many" times is not reliable. What we do here, instead, 
is collecting information on sites displaying some specified histories (or specified unordered 
histories). 

For any n G N poss , let T™ (i > 1) be the successive times where the unordered history of 
the currently occupied site is n: 

• if := M{k > | N(k) = n} 

• Vi > 0, T? +1 := M{k > if | N(k) = n} 

(We ignore the case (happening on the negligible event {Sft is finite}) where some T™ is 
infinite.) 

Proposition 4. The E-valued random variables A™ := X T a , 1 — X T n (i > 1, n G M poss ) 

i ' i 

are independent. Also, for all n G M poss , the random variables A" have the same law: 

Ve G E, P(Af = e) = V e (n). 

Proof. Let 0™ (i > 1, n G (Z2)o) be independent random variables such that for all e G E, 

P(Gf = e) = 14(n) 
Now we consider the process (Y n )n>o : 

where N y is to the process Y what N is to the process X, and 

r(n) := card{j G {0, • • • ,n}\ N y {j) = N y (n)}. 

P(Y n+1 = Y n + e\ a(Y k , k<n)) = P(0^ n) = e\a(Y k ,k< n)) 

= E Ez>0,me(ZG) l T(n)=l ^ y (n)=rn 1 ef=e I <^(Xk, k < «)] 

But on the event A[^^ n := {r(n) = l,N y {n) = m}, 0™ is independent of a(Yk,k < n). 
Thus, 

P(Y n+1 = y n + e | a(F fe , k <n)) = T,i>o.rh£(z?)n ^^Wi.ivJnWfiEl 1 



!>0,me(Z^) - L r(n)=i,iV; / (n)=m iJ L- L e™=eJ 
= Z)z>0,rne(Z$)o 1 r(n)=«,Afj / (n)=m^ e (™') 

= K(4(n)) 

Consequently, the two processes X and Y have the law. But A™ is to the process (X n ) n >o 
exactly what 0™ is to the process Y. The result follows. □ 
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We deduce from this proposition the following corollary, which describes the construction 
of the reinforcement function on Af poS s or , equivalently (by Proposition 1), of the annealed 
law: 

Corollary 2. If ft G N poss , then almost surely, for all e G E, 

IaS— P ~\~ •■• ~\~ 1a™ =e 

— ► Vel^j « s m —> co- 
rn 

4.2. The law of the environment 

The next result follows easily 

Theorem 1. (a) A single trajectory determines almost surely the moments of the form 
E^fz/^ 1 ■■■Vru v t\ f or a M r i)---) r fc S R, t G T, ni,..., nfe G Z+, e = or 1. Moreover, if 
these moments coincide for two distinct environment distributions, the induced RWRE have 
the same annealed law (and, consequently, two such environment distributions cannot be 
distinguished). 

(b) If card T = or 1, a single trajectory determines almost surely the distribution of 
the environment. 

Proof, (a) By Corollary 2, the restriction of V to M poss is almost surely determined by a 
single trajectory. So almost surely, for all n = (n g ) ge a G Nposs and for all e G G, a single 
trajectory determines the moments E[(rj gG f 9 9 ) • u e \. Moreover, all the other moments of 
the type E^fz^ 1 ■ ■ • v™£uf\ [r,j G R, t G T) are zero. Finally, the restriction of V to M poS s 
determines the law of the process. 

(b) If cardT = 0, we get all the moments of the z/ e 's. Since these variables are compactly 
supported variables, this determines all the finite dimensional marginals of the distribution 
of v} 

If cardT = 1, we get all the moments involving the v r 's. And if t is the unique element 
of T, then v t = 1 — Y2 t gr u r, an d we get all the moments of v. □ 

When card T > 2 the law of the environment can be determined in some cases, but not in 
general. (Accordingly, Corollary 1 of [2] should be amended; it holds in fact if card T < 1, 
but not in complete generality.) Here are two examples: 

Example 1. We consider the two following walks on Z: 

• The first one has a deterministic environment, and moves from x G Z with probability | 
to x + 1, with probability \ to rr + 2. 

• The environment in the second walk is coin-tossed independently at each site x G Z: with 
probability i, the transition probability to x + 1 is 1, and with probability i, the transition 
probability to x + 2 is 1. 

Here T = {1,2}, and, obviously, the two walks have the same law. 
Example 2. Again, G = Z; for any x G Z, with probability i the transition probability 
from x to x is equal to ^, and the transition probability from x to x + 1 is also equal to ^; 

"*"We recall a standard fact: if Ui, . . . , Ui are positive random variables such that Ui + ■ • ■ + Ui 6 [0, 1] 
almost surely then, for Lebesgue-almost all (ai, . . . , ai) £ M , 

P(J7i < oi, • ■ • , Ui < ai) = lim V , , nl . , E[(1 - Ui (70 fe ° ■ U? 1 ■ ■ ■ U? 1 ]. 

n—roo * — ' fcn! • • • Kll 

kg fc,>0 

fcoH hfcj=n 
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and, with probability |, the transition probability from x to x + 2 is equal to 1. In this case, 
T = {1,2}, but the distribution of the environment is almost surely completely determined 
by the single trajectory we observe. 

We can only see O-transitions (from a site x to itself), 1-transitions {x — ► x + 1) and 
2-transitions [x — ► x + 2). So fx, which is the law of ^(0), satisfies /z(z/o + v\ + v-i = 1) = 1. 
The fact that a 0-transition is never followed by a 2-transition tells us that if v<i > 0, then 
vq = 0. Statistics on sites from which there are O-transitions or 1-transitions reveals that 
the (conditional) distribution of the number of O-transitions from such a site is geometric. 
But a geometric distribution cannot be a nondegenerate convex combination of geometric 
distributions, and the conditional number of O-transitions from a visited site x for which 
u(x, 0) is given (and is in ]0, 1[) has a geometric distribution. We deduce that there is exactly 
one value a (= ^) such that almost surely, for all x £ Z, if v(x, 0) ^ 0, then v(x, 0) = a. And 
a simple computation shows that the proportion of visited sites from which a 0-transition is 
possible but does not take place fits exactly with the proportion of visited sites from which 
the first (and unique) transition is a 1-transition; so n{v\ > and V2 > 0) = 0. We deduce 
that \x is a convex combination of /i 1 and /i 2 , where /i 1 (^o = v \ = 5) = 1 an d A* 2 (^2 = 1) = 1. 
And the coefficients (both equal 1 ;) in this convex combination are easily determined. 

4.3. Sampling iid trajectories 

Here we show that infinitely many independent trajectories drawn under can be 
generated by concatenating steps of the single trajectory at our disposal. 

For simplicity, we describe the construction of just two trajectories, A 1 and X 2 . This is 
enough, since we can do the following: once X 1 and X 2 are constructed, leave X 1 as it is 
and extract two trajectories, say X 3 and X 4 , out of X 2 exactly the way we extracted X 1 and 
X 2 out of X, then, out of A 4 , get X 5 and A 6 , and so on; and the family (A 1 , X 3 , X 5 , ...) 
is exactly what we want. 

All we do is construct X 1 (resp. X 2 ) using in the "natural" way the transitions A™ with 
i odd (resp. even). More formally, X 1 and X 2 are defined as follows: 

yl _ n Y 1 yl _i_ a^V) 

y\. — U, ^n+l n ' 2r 1 (n) — 1' 

where N 1 is to the process A 1 what N is to the process A, and 

T l {n) :=card{j G {0, • • • ,n}\N 1 (j) = N l {n)}- 

and, similarly, 

v-2 _ n y-2 _ „ 2 , A ^V 2 (n) 
A - U > A n+1 - A n + ^2r 2 (n)' 

where N 2 is to the process A 2 what A is to the process A, and 

r 2 (n) := card{j G {0, • • • ,n} \ N 2 (j) = N 2 (n)}. 
The validity of this construction is an immediate consequence of Proposition 4. 

5. Remarks 

5.1. Infinitude of S. The "infinitude assumption" (according to which the random set S 
of sites visited by A is almost surely an infinite set) is made in order to avoid discussing 
rather trivial cases. (If S is finite, then precise knowledge of the environment at some sites 
is almost surely available; but, unless some specific conditions are imposed on /x, complete 
knowledge of fi is out of reach.) 
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Proceeding along the general lines of the proof of Proposition 3, one can show that the 
random set S is either almost surely infinite or almost surely finite. 

5.2. Countability of G. If we abandon the countability assumption on G, the set E = 
{x 6 G | P{X\ = x) > 0} remains countable, and our sampling procedure works just as 
well. Consequently, P' 4 can be determined in non-pathological situations (and in particular 
if Gis the real line). 2 (This can also be seen by introducing a new kind of reinforcement 
function which, to a given unordered history at a site, associates the probability that the 
next transition falls into some measurable set.) If there is no countable set D C G such 
that X\ 6 D almost surely, then li cannot be determined (as one can see after studying 
the first example of section 4). (If there is some countable set D C G such that X\ € D 
almost surely then, almost surely, each X n is in the subgroup generated by D; and since 
this subgroup is countable, all we did is adaptable in an obvious way.) 

5.3. Structure of G. The choice of dealing with RWRE on a group captures, we think, the 
essence of the matter. We could have restricted ourselves to groups like 1 d (or some other 
subgroups of M. d ) without a substantial gain in simplicity. A group structure is suitable 
(though not absolutely indispensable) if the notion of iid random environment is to make 
sense. We could have dealt with RWRE on homogenous spaces, or on some trees, without 
gaining new insight. 

5.4. Assumptions on the environment. The requirement that environment at sites are 
iid can be loosened in various ways. 

Example. G = Z; there are two laws for the environment at sites, say /io and m; environments 
at sites are independent; and v(n) is governed by fiQ if n is even, by fi± if n is odd. 

Example. G = Z; there are laws liq, . .for the environment at a site; K is a random 
variable taking values in the set {2, 3, . . . }; and, conditioned on K, the v{n) are independent 
and, for all n, v{n) is governed by ^ n ( mo dK)- 

Example. G = Z; the couples {v(2n),v{2n + 1)) (n € Z) are iid, but z^(0) and z^(l) are not 
independent. 

5.5. Other reinforcements. Our results on the determination of the law of the process 
and on sampling iid trajectories can be extended to various other edge-oriented reinforced 
random walks that do not correspond to a random environment. Whenever an appropriate 
analogue of Proposition 3 is valid, things work quite well. (A sufficient condition is strict 
positivity of the restriction of V r to (Z^)o for all r E R.) 
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